Communication and Streaming Complexity of Approximate Pattern Matching
نویسنده
چکیده
We consider the approximate pattern matching problem. Given a text T of length 2n and a pattern P of length n, the task is to decide for each prefix T [1, j] of T if it ends with a string that is at the edit distance at most k from P . If this is the case, we must output the edit distance and the corresponding edit operations. We first look at the communication complexity of the problem. We show the following: If Alice and Bob both share the pattern and Alice holds the first half of the text and Bob the second half, then the deterministic one-way communication complexity of the problem is Θ(k logn). If Alice holds the first half of the text, Bob the second half of the text, and Charlie the pattern, then there is a deterministic one-way communication protocol that uses O(k √ n logn) bits. We then develop the first sublinear-space streaming algorithm for the problem. There exists a streaming algorithm that solves the problem in O(k8 √ n log6 n) space. The worst-case time complexity of the algorithmO((k2 √ n+k13)·log4 n) per arrival. The algorithm is randomised with error probability at most 1/poly(n). 1998 ACM Subject Classification F.2 Analysis of Algorithms and Problem Complexity
منابع مشابه
Maximum Matchings in Dynamic Graph Streams and the Simultaneous Communication Model
We study the problem of finding an approximate maximum matching in two closely related computational models, namely, the dynamic graph streaming model and the simultaneous multi-party communication model. In the dynamic graph streaming model, the input graph is revealed as a stream of edge insertions and deletions, and the goal is to design a small space algorithm to approximate the maximum mat...
متن کاملOn the communication and streaming complexity of maximum bipartite matching
Consider the following communication problem. Alice holds a graph GA = (P,Q,EA) and Bob holds a graph GB = (P,Q,EB), where |P | = |Q| = n. Alice is allowed to send Bob a message m that depends only on the graph GA. Bob must then output a matching M ✓ EA [ EB . What is the minimum message size of the message m that Alice sends to Bob that allows Bob to recover a matching of size at least (1 ✏) t...
متن کاملThe streaming k-mismatch problem
We consider the streaming complexity of a fundamental task in approximate pattern matching: the k-mismatch problem. It asks to compute Hamming distances between a pattern of length n and all length-n substrings of a text for which the Hamming distance does not exceed a given threshold k. In our problem formulation, we report not only the Hamming distance but also, on demand, the full mismatch i...
متن کاملApproximate Hamming Distance in a Stream
We consider the problem of computing a (1+ε)-approximation of the Hamming distance between a pattern of length n and successive substrings of a stream. We first look at the one-way randomised communication complexity of this problem. We show the following: If Alice and Bob both share the pattern and Alice has the first half of the stream and Bob the second half, then there is an O(ε−4 log2 n) b...
متن کاملStreaming Communication Protocols
We define the Streaming Communication model that combines the main aspects of communication complexity and streaming. We consider two agents that want to compute some function that depends on inputs that are distributed to each agent. The inputs arrive as data streams and each agent has a bounded memory. Agents are allowed to communicate with each other and also update their memory based on the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017